NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

BI‐LAVA: Biocuration With Hierarchical Image Labelling Through Active Learning and Visual Analytics

https://doi.org/10.1111/cgf.15261

Trelles, Juan; Wentzel, Andrew; Berrios, William; Shatkay, Hagit; Marai, G Elisabeta (February 2025, Computer Graphics Forum)

Abstract In the biomedical domain, taxonomies organize the acquisition modalities of scientific images in hierarchical structures. Such taxonomies leverage large sets of correct image labels and provide essential information about the importance of a scientific publication, which could then be used in biocuration tasks. However, the hierarchical nature of the labels, the overhead of processing images, the absence or incompleteness of labelled data and the expertise required to label this type of data impede the creation of useful datasets for biocuration. From a multi‐year collaboration with biocurators and text‐mining researchers, we derive an iterative visual analytics and active learning (AL) strategy to address these challenges. We implement this strategy in a system called BI‐LAVA—Biocuration with Hierarchical Image Labelling through Active Learning and Visual Analytics. BI‐LAVA leverages a small set of image labels, a hierarchical set of image classifiers and AL to help model builders deal with incomplete ground‐truth labels, target a hierarchical taxonomy of image modalities and classify a large pool of unlabelled images. BI‐LAVA's front end uses custom encodings to represent data distributions, taxonomies, image projections and neighbourhoods of image thumbnails, which help model builders explore an unfamiliar image dataset and taxonomy and correct and generate labels. An evaluation with machine learning practitioners shows that our mixed human–machine approach successfully supports domain experts in understanding the characteristics of classes within the taxonomy, as well as validating and improving data quality in labelled and unlabelled collections.
more » « less
Free, publicly-accessible full text available February 1, 2026
DITTO: A Visual Digital Twin for Interventions and Temporal Treatment Outcomes in Head and Neck Cancer

https://doi.org/10.1109/TVCG.2024.3456160

Wentzel, Andrew; Attia, Serageldin; Zhang, Xinhua; Canahuate, Guadalupe; Fuller, Clifton David; Marai, G Elisabeta (January 2025, IEEE Transactions on Visualization and Computer Graphics)

Full Text Available
Roses Have Thorns: Understanding the Downside of Oncological Care Delivery Through Visual Analytics and Sequential Rule Mining

https://doi.org/10.1109/TVCG.2023.3326939

Floricel, Carla; Wentzel, Andrew; Mohamed, Abdallah; Fuller, CDavid; Canahuate, Guadalupe; Marai, GElisabeta (January 2024, IEEE Transactions on Visualization and Computer Graphics)

Personalized head and neck cancer therapeutics have greatly improved survival rates for patients, but are often leading to understudied long-lasting symptoms which affect quality of life. Sequential rule mining (SRM) is a promising unsupervised machine learning method for predicting longitudinal patterns in temporal data which, however, can output many repetitive patterns that are difficult to interpret without the assistance of visual analytics. We present a data-driven, human-machine analysis visual system developed in collaboration with SRM model builders in cancer symptom research, which facilitates mechanistic knowledge discovery in large scale, multivariate cohort symptom data. Our system supports multivariate predictive modeling of post-treatment symptoms based on during-treatment symptoms. It supports this goal through an SRM, clustering, and aggregation back end, and a custom front end to help develop and tune the predictive models. The system also explains the resulting predictions in the context of therapeutic decisions typical in personalized care delivery. We evaluate the resulting models and system with an interdisciplinary group of modelers and head and neck oncology researchers. The results demonstrate that our system effectively supports clinical and symptom research.
more » « less
Full Text Available
Kiviat Defense: An Empirical Evaluation of Visual Encoding Effectiveness in Multivariate Data Similarity Detection

https://doi.org/10.2352/J.ImagingSci.Technol.2023.67.6.060406

Mantovani, Mirko; Wentzel, Andrew; Trabucco, Juan Trelles; Michaelis, Joseph; Elisabeta_Marai, G (November 2023, Journal of Imaging Science and Technology)

Similarity detection seeks to identify similar, but distinct items over multivariate datasets. Often, similarity cannot be defined computationally, leading to a need for visual analysis, such as in cases with ensemble, computational, patient cohort, or geospatial data. In this work, we empirically evaluate the effectiveness of common visual encodings for multivariate data in the context of visual similarity detection. We conducted a user study with 40 participants to measure similarity detection performance and response time under moderate scale (16 items) and large scale (36 items). Our analysis shows that there are significant differences in performance between encodings, especially as the number of items increases. Surprisingly, we found that juxtaposed star plots outperformed superposed parallel coordinate plots. Furthermore, color-cues significantly improved response time, and attenuated error at larger scales. In contrast to existing guidelines, we found that filled star plots (Kiviats) outperformed other encodings in terms of scalability and error.
more » « less
Full Text Available
A Lens to Pandemic Stay at Home Attitudes

https://doi.org/10.1109/Vis4PandEmRes60343.2023.00008

Wentzel, Andrew; Levine, Lauren; Dhariwal, Vipul; Fatemi, Zahra; Bhattacharya, Abari; Di_Eugenio, Barbara; Rojecki, Andrew; Zheleva, Elena; Marai, G Elisabeta (October 2023, IEEE)

We describe the design process and the challenges we met during a rapid multi-disciplinary pandemic project related to stay-at-home orders and social media moral frames. Unlike our typical design experience, we had to handle a steeper learning curve, emerging and continually changing datasets, as well as under-specified design requirements, persistent low visual literacy, and an extremely fast turnaround for new data ingestion, prototyping, testing and deployment. We describe the lessons learned through this experience.
more » « less
Full Text Available
Spatially-aware clustering improves AJCC-8 risk stratification performance in oropharyngeal carcinomas

https://doi.org/10.1016/j.oraloncology.2023.106460

Canahuate, Guadalupe; Wentzel, Andrew; Mohamed, Abdallah SR; van_Dijk, Lisanne V; Vock, David M; Elgohari, Baher; Elhalawani, Hesham; Fuller, Clifton D; Marai, G Elisabeta (September 2023, Oral Oncology)

Objective: Evaluate the effectiveness of machine learning tools that incorporate spatial information such as disease location and lymph node metastatic patterns-of-spread, for prediction of survival and toxicity in HPV+ oropharyngeal cancer (OPC). Materials & methods: 675 HPV+ OPC patients that were treated at MD Anderson Cancer Center between 2005 and 2013 with curative intent IMRT were retrospectively collected under IRB approval. Risk stratifications incorporating patient radiometric data and lymph node metastasis patterns via an anatomically-adjacent representation with hierarchical clustering were identified. These clusterings were combined into a 3-level patient stratification and included along with other known clinical features in a Cox model for predicting survival outcomes, and logistic regression for toxicity, using independent subsets for training and validation. Results: Four groups were identified and combined into a 3-level stratification. The inclusion of patient stratifications in predictive models for 5-yr Overall survival (OS), 5-year recurrence free survival, (RFS) and Radiation-associated dysphagia (RAD) consistently improved model performance measured using the area under the curve (AUC). Test set AUC improvements over models with clinical covariates, was 9 % for predicting OS, and 18 % for predicting RFS, and 7 % for predicting RAD. For models with both clinical and AJCC covariates, AUC improvement was 7 %, 9 %, and 2 % for OS, RFS, and RAD, respectively. Conclusion: Including data-driven patient stratifications considerably improve prognosis for survival and toxicity outcomes over the performance achieved by clinical staging and clinical covariates alone. These stratifications generalize well to across cohorts, and sufficient information for reproducing these clusters is included.
more » « less
Full Text Available
Multi-organ spatial stratification of 3-D dose distributions improves risk prediction of long-term self-reported severe symptoms in oropharyngeal cancer patients receiving radiotherapy: development of a pre-treatment decision support tool

https://doi.org/10.3389/fonc.2023.1210087

Wentzel, Andrew; Mohamed, Abdallah_S R; Naser, Mohamed A; van_Dijk, Lisanne V; Hutcheson, Katherine; Moreno, Amy M; Fuller, Clifton D; Canahuate, Guadalupe; Marai, G Elisabeta (August 2023, Frontiers in Oncology)

PURPOSE: Identify Oropharyngeal cancer (OPC) patients at high-risk of developing long-term severe radiation-associated symptoms using dose volume histograms for organs-at-risk, via unsupervised clustering. MATERIAL AND METHODS: All patients were treated using radiation therapy for OPC. Dose-volume histograms of organs-at-risk were extracted from patients’ treatment plans. Symptom ratings were collected via the MD Anderson Symptom Inventory (MDASI) given weekly during, and 6 months post-treatment. Drymouth, trouble swallowing, mucus, and vocal dysfunction were selected for analysis in this study. Patient stratifications were obtained by applying Bayesian Mixture Models with three components to patient’s dose histograms for relevant organs. The clusters with the highest total mean doses were translated into dose thresholds using rule mining. Patient stratifications were compared against Tumor staging information using multivariate likelihood ratio tests. Model performance for prediction of moderate/severe symptoms at 6 months was compared against normal tissue complication probability (NTCP) models using cross-validation. RESULTS: A total of 349 patients were included for long-term symptom prediction. High-risk clusters were significantly correlated with outcomes for severe late drymouth (p <.0001, OR = 2.94), swallow (p = .002, OR = 5.13), mucus (p = .001, OR = 3.18), and voice (p = .009, OR = 8.99). Simplified clusters were also correlated with late severe symptoms for drymouth (p <.001, OR = 2.77), swallow (p = .01, OR = 3.63), mucus (p = .01, OR = 2.37), and voice (p <.001, OR = 19.75). Proposed cluster stratifications show better performance than NTCP models for severe drymouth (AUC.598 vs.559, MCC.143 vs.062), swallow (AUC.631 vs.561, MCC.20 vs -.030), mucus (AUC.596 vs.492, MCC.164 vs -.041), and voice (AUC.681 vs.555, MCC.181 vs -.019). Simplified dose thresholds also show better performance than baseline models for predicting late severe ratings for all symptoms. CONCLUSION: Our results show that leveraging the 3-D dose histograms from radiation therapy plan improves stratification of patients according to their risk of experiencing long-term severe radiation associated symptoms, beyond existing NTPC models. Our rule-based method can approximate our stratifications with minimal loss of accuracy and can proactively identify risk factors for radiation-associated toxicity.
more » « less
Full Text Available
Externally validated digital decision support tool for time-to-osteoradionecrosis risk-stratification using right-censored multi-institutional observational cohorts

https://doi.org/10.1016/j.radonc.2025.110890

Humbert-Vidan, Laia; Kamel, Serageldin; Wentzel, Andrew; Kaffey, Zaphanlene; Abdelaal, Moamen; Spier, Kyle B; West, Natalie A; Marai, GElisabeta; Canahuate, Guadalupe; Zhang, Xinhua; et al (June 2025, Radiotherapy and Oncology)

Free, publicly-accessible full text available June 1, 2026
Understanding Stay-at-home Attitudes through Framing Analysis of Tweets

https://doi.org/10.1109/DSAA54385.2022.10032455

Fatemi, Zahra; Bhattacharya, Abari; Wentzel, Andrew; Dhariwal, Vipul; Levine, Lauren; Rojecki, Andrew; Marai, G. Elisabeta; Di Eugenio, Barbara; Zheleva, Elena (October 2022, Proceedings of 2022 IEEE 9th International Conference on Data Science and Advanced Analytics (DSAA))

Full Text Available
THALIS: Human-Machine Analysis of Longitudinal Symptoms in Cancer Therapy

https://doi.org/10.1109/TVCG.2021.3114810

Floricel, Carla; Nipu, Nafiul; Biggs, Mikayla; Wentzel, Andrew; Canahuate, Guadalupe; Van Dijk, Lisanne; Mohamed, Abdallah; Fuller, C.David; Marai, G.Elisabeta (January 2022, IEEE Transactions on Visualization and Computer Graphics)

Full Text Available

« Prev Next »

Search for: All records